AITopics | balanced distribution

Collaborating Authors

balanced distribution

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

876b45367d9069f0e91e359c57155ab1-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 12:44:58 GMT

dataset, evaluation, exp, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Supplementary Material

Neural Information Processing SystemsAug-16-2025, 16:32:42 GMT

This is the Supplementary Material to Paper "Fair Wrapping for Black-box Predictions".

artificial intelligence, dataset, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Transformers Struggle to Learn to Search

Saparov, Abulhair, Pawar, Srushti, Pimpalgaonkar, Shreyas, Joshi, Nitish, Pang, Richard Yuanzhe, Padmakumar, Vishakh, Kazemi, Seyed Mehran, Kim, Najoung, He, He

arXiv.org Artificial IntelligenceDec-5-2024

Search is an ability foundational in many important tasks, and recent studies have shown that large language models (LLMs) struggle to perform search robustly. It is unknown whether this inability is due to a lack of data, insufficient model parameters, or fundamental limitations of the transformer architecture. In this work, we use the foundational graph connectivity problem as a testbed to generate effectively limitless high-coverage data to train small transformers and test whether they can learn to perform search. We find that, when given the right training distribution, the transformer is able to learn to search. We analyze the algorithm that the transformer has learned through a novel mechanistic interpretability technique that enables us to extract the computation graph from the trained model. We find that for each vertex in the input graph, transformers compute the set of vertices reachable from that vertex. Each layer then progressively expands these sets, allowing the model to search over a number of vertices exponential in the number of layers. However, we find that as the input graph size increases, the transformer has greater difficulty in learning the task. This difficulty is not resolved even as the number of parameters is increased, suggesting that increasing model scale will not lead to robust search abilities. We also find that performing search in-context (i.e., chain-of-thought) does not resolve this inability to learn to search on larger graphs.

large language model, machine learning, vertex, (19 more...)

arXiv.org Artificial Intelligence

2412.04703

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Asia > Singapore (0.04)
(10 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Align, Distill, and Augment Everything All at Once for Imbalanced Semi-Supervised Learning

Aimar, Emanuel Sanchez, Helgesen, Hannah, Felsberg, Michael, Kuhlmann, Marco

arXiv.org Artificial IntelligenceJun-7-2023

Addressing the class imbalance in long-tailed semi-supervised learning (SSL) poses a few significant challenges stemming from differences between the marginal distributions of unlabeled data and the labeled data, as the former is often unknown and potentially distinct from the latter. The first challenge is to avoid biasing the pseudo-labels towards an incorrect distribution, such as that of the labeled data or a balanced distribution, during training. However, we still wish to ensure a balanced unlabeled distribution during inference, which is the second challenge. To address both of these challenges, we propose a three-faceted solution: a flexible distribution alignment that progressively aligns the classifier from a dynamically estimated unlabeled prior towards a balanced distribution, a soft consistency regularization that exploits underconfident pseudo-labels discarded by threshold-based methods, and a schema for expanding the unlabeled set with input data from the labeled partition. This last facet comes in as a response to the commonly-overlooked fact that disjoint partitions of labeled and unlabeled data prevent the benefits of strong data augmentation on the labeled set. Our overall framework requires no additional training cycles, so it will align, distill, and augment everything all at once (ADALLO). Our extensive evaluations of ADALLO on imbalanced SSL benchmark datasets, including CIFAR10-LT, CIFAR100-LT, and STL10-LT with varying degrees of class imbalance, amount of labeled data, and distribution mismatch, demonstrate significant improvements in the performance of imbalanced SSL under large distribution mismatch, as well as competitiveness with state-of-the-art methods when the labeled and unlabeled data follow the same marginal distribution. Our code will be released upon paper acceptance.

artificial intelligence, machine learning, semi-supervised learning, (11 more...)

arXiv.org Artificial Intelligence

2306.04621

Country: Europe > Sweden > Östergötland County > Linköping (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Causal Balancing for Domain Generalization

Wang, Xinyi, Saxon, Michael, Li, Jiachen, Zhang, Hongyang, Zhang, Kun, Wang, William Yang

arXiv.org Artificial IntelligenceFeb-19-2023

While machine learning models rapidly advance the state-of-the-art on various real-world tasks, out-of-domain (OOD) generalization remains a challenging problem given the vulnerability of these models to spurious correlations. We propose a balanced mini-batch sampling strategy to transform a biased data distribution into a spurious-free balanced distribution, based on the invariance of the underlying causal mechanisms for the data generation process. We argue that the Bayes optimal classifiers trained on such balanced distribution are minimax optimal across a diverse enough environment space. We also provide an identifiability guarantee of the latent variable model of the proposed data generation process, when utilizing enough train environments. Experiments are conducted on DomainBed, demonstrating empirically that our method obtains the best performance across 20 baselines reported on the benchmark.

artificial intelligence, generalization, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2206.05263

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

Finding and Fixing Spurious Patterns with Explanations

Plumb, Gregory, Ribeiro, Marco Tulio, Talwalkar, Ameet

arXiv.org Artificial IntelligenceAug-17-2022

Image classifiers often use spurious patterns, such as "relying on the presence of a person to detect a tennis racket, which do not generalize. In this work, we present an end-to-end pipeline for identifying and mitigating spurious patterns for such models, under the assumption that we have access to pixel-wise object-annotations. We start by identifying patterns such as "the model's prediction for tennis racket changes 63% of the time if we hide the people." Then, if a pattern is spurious, we mitigate it via a novel form of data augmentation. We demonstrate that our method identifies a diverse set of spurious patterns and that it mitigates them by producing a model that is both more accurate on a distribution where the spurious pattern is not helpful and more robust to distribution shift.

balanced distribution, spire, spurious, (14 more...)

arXiv.org Artificial Intelligence

2106.02112

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Sports > Tennis (0.98)

Technology:

Information Technology > Artificial Intelligence > Vision (0.94)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language (0.68)

Add feedback

Robot-analysts make BETTER stock recommendations than human investors, study finds

Daily Mail - Science & techFeb-11-2020, 22:01:05 GMT

Robots are said to take over some 200,000 jobs on Wall Street over the next decade and a new study suggests this prediction could soon become a reality. Following the analysis of 76,000 reports from seven different robo-analysis firms, researchers determined that the technology is able to make recommendations similar to their human counterparts - but faster and more accurately. Because the automation is less subject to behavioral biases and conflicts of interest, it can produce a more balanced distribution of ratings, which includes investment's risk and suggestions whether to hold, sell or purchase. Looking at the robot portfolios, the study found their buy recommendations earned returns from 6.4 percent to 6.9 percent, while those of its human counterparts only ranged from 1.2 percent to 1.7 percent. Although robo-analysis sounds like it could weed out human investors, researchers believe that as long as there are people that need human interaction, 'the buy-side, the sell-side will still be around.' Because the automation is less subject to behavioral biases and conflicts of interest, it can produce a more balanced distribution of ratings, which includes investment's risk and suggestions whether to hold, sell or purchase (stock photo) The study was conducted by a team at Indiana University, who wrote: 'Our study provides the first comprehensive analysis of the properties of investment recommendations generated by'Robo-Analysts,' which are human-analyst assisted computer programs conducting automated research analysis.

human investor, recommendation, robot-analyst make better stock recommendation, (10 more...)

Daily Mail - Science & tech

Country:

North America > United States > New York > New York County > New York City (0.26)
North America > United States > Indiana (0.26)

Genre:

Research Report > New Finding (0.94)
Research Report > Experimental Study (0.57)

Industry: Banking & Finance > Trading (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (0.90)

Add feedback

Mario Schlechter on LinkedIn: "Yesterday we showed you that we embrace the #future #mobility. Today I would like to invite you to a free training on Operide, our micro-mobility fleet management application based on #ai! So you can make sure that you provide a more balanced distribution of #eBikes or even #eScooters! Just because #itsyourcity! Join our free training!"

#artificialintelligenceAug-29-2019, 17:18:21 GMT

Yesterday we showed you that we embrace the #future #mobility. Today I would like to invite you to a free training on Operide, our micro-mobility fleet management application based on #ai! So you can make sure that you provide a more balanced distribution of #eBikes or even #eScooters! Operide, our #ai driven shared micro-mobility fleet management application, optimises the rebalancing process so that more assets (bikes/scooters) are available to the end-user.

artificial intelligence, free training, social media, (7 more...)

#artificialintelligence

Industry:

Transportation > Freight & Logistics Services (1.00)
Education > Educational Setting > Online (0.34)

Technology:

Information Technology > Site Management (1.00)
Information Technology > Communications > Social Media (0.85)
Information Technology > Artificial Intelligence (0.70)

Add feedback